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Abstract: After revealing difficulties of the standard time-dependent pertur- 
bation theory in quantum mechanics mainly from the viewpoint of practical 
calculation, we propose a new quasi-canonical perturbation theory. In the new 
theory, the dynamics of physical observables, instead of that of coefficients of 
wave-function expansion, is formulated so that the gauge-invariance and corre- 
spondence principles are observed naturally. 
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1 Introduction 

The standard time-dependent perturbation theory has two versions: one is for 
classical mechanics (CM) and the other for quantum mechanics (QM). The 
quantum version, proposed by Diracjl| at the early stage of QM, has been 
included as an important content in almost every textbook of QMQQ and 
employed in many papers throughout various physical fields. 

Although the two versions of the theory deal with different dynamical equa- 
tions, Hamilton's equations in CM and Schrodinger's equation in QM, they 
use the same canonical variable system and the same Hamiltonian-separation 
technique. In light of the methodology consistency, people believe that the cor- 
respondence principle, which states that QM must be consistent with CM in 
the classical limit % — ► 0, is well observed. 

During the last two decades, difficulties related to the canonical perturbation 
theory in CM revealed themselves Q-Q. In particular, LittlejohnQ indicated 
that with the use of the vector potential the standard theory mixes up the 
ordering scheme and proposed using the elegant Lie-formalism to obviate the 
difficulty. In a relatively recent paper Q], we manifestly showed that the standard 
perturbation theory in CM encounters gauge difficulties such that in numerical 
calculations errors larger than expected may involve and in theoretical analyses 
unphysical formulations may result. By employing quasi-canonical variables (or 
"pseudo-canonical" variables), we established another perturbation formalism 
in CM, which suffers from no gauge difficulties and is still cast in terms of 
"ordinary" Hamiltonian mechanics. 

In contrast with the developments in classical mechanics, the quantum ver- 
sion of the perturbation theory has not been challenged in a similar way. A 
question arises very naturally. How about the correspondence principle if we, 
on the one hand, revise the classical version of the theory and, on the other 
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hand, assume that the quantum version of the theory needs no reconsideration? 

This paper is devoted to investigating the issue. Sec. 2 briefly recalls the 
standard time-dependent perturbation theory in QM. In Sec. 3 we illustrate 
difficulties that the standard quantum perturbation theory involves. The il- 
lustration is mainly from the viewpoint of practical calculations. In Sec. 4, 
a new perturbation theory is introduced by dealing with physical variables in 
the Heisenberg picture, and the formulation in the Schrodinger picture is given 
afterwards. In Sec. 5, we give application and discussion of the new theory. 
Finally, the paper is summarized in Sec. 6. 



2 Standard Perturbation Theory 

For convenience of the later discussion, we briefly recall the standard time- 
dependent perturbation theory in QM. 

Consider a quantum system whose Hamiltonian H splits into two parts 

H = H + H 1 (1) 

where the time-independent H describes the unperturbed system and the time- 
dependent Hi represents involved perturbations. It is assumed that the unper- 
turbed system is exactly solvable, and solutions of it can be expressed by 

exp(-^e„t)*„(q) (2) 

where e n and 4 , n (q) are the eigenenergy and the eigenfunction. The basic idea 
of the standard theory lies in expanding the state function of the real system 
(described by the total Hamiltonian H) in terms of the known unperturbed 
solutions. In other words, one writes the real state function as 

*(t) - ^C„exp(-J £ „t)*„, (3) 

n 

and then try to determine the dynamics of the coefficients. If the perturbation 
is absent, the coefficients C„'s in Eq. (3) are obviously constants. When the 
perturbation Hi is taken into account, the coefficients are regarded as depending 
on time only, or, in explicit form, they can be expressed by 

C n = C n (t). (4) 

By substituting Eq. (3) into the Schrodinger equation, using the orthogonal- 
ity of the eigenfunctions, equations of motion for all individual coefficients are 
obtained as 

dC 

iK-TT = X! C k( H i)nk exp(iuj nk t), (5) 
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where aj n k = {£n — and (H\) n k is the matrix element of the perturb- 

ing Hamiltonian Hi in terms of the eigenfunctions and ^k- If the initial 
conditions 

C fe (0) = 1 and C„(0) = (if n ^ fc) (6) 

are introduced, which means that the system is in the A;-state initially, then (||) 
yields 

i 



C n {t) 



h Jo 



{Hi) nk exp(iw nfc r)dr (n k). 



(7) 



Note that in obtaining ([?]) a mathematical approximation method, the method 
of variation of constant, is applied. 

After the perturbation turns off, the system settles down again and the 
probability of finding the system in the n-state is 



Pnk = |C„( + 00)| 2 = 



+ OC 



{Hi) nk exp(iw„ fc T)(iT 



(8) 



It is quite common in textbooks to consider a microsystem perturbed by an 
electromagnetic field as a further illustration of the method 0). For instance, a 
charged particle in an atomlike system may be described by 



Ho = ^~P J 

2m 



(9) 



where Q is the charge of the particle and $o represents the unperturbed field. If 
an electromagnetic perturbation applies, the perturbing Hamiltonian takes the 
form 

^ = -^A 1 *p + ^ + Q$1 (10) 

where "*" means an symmetry operation such that for any two operators / 
and g, which may be a scalar or a vector, we have (the notation will be used 
throughout this paper) 

f*9=kf-9 + 9-h). (11) 



In such notation, (g) becomes 

Pnk = 



+°° / Q 

-— Ai *p + Q$i 



exp(?w„ fc r)dr 



(12) 



where the A^-term has been omitted as a second-order quantity. 



3 Difficulties of the Standard Theory 

The standard perturbation theory outlined in the last section, while seeming 
stringent and flawless, suffers from difficulties. One of the difficulties has some 
thing to do with the gauge transformation. 
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It is very easy to see that the resultant formula ([E 
If the gauge fields Ai, $1 are replaced by 



1 df 

A 1+ V/, 

c at 



is not gauge-invariant. 



(13) 



the expression (O) becomes 



P-nk — 



(Ai+V/)*p 

mc 



kT dT 



(14) 



Apparently, this expression can take any value. Also note that including the 
ylj-term in it provides no improvement. 

The difficulty was known long ago and many discussions appeared in the 
literature; but, oddly enough, no consensus has been reached. If different physi- 
cists in the community are asked, different answers will be obtained^. Some 
suggested that the so-called preferential gauge, in which the vector potential van- 
ishes whenever the electromagnetic perturbation is off, should be employed p0|; 
Others proposed that a certain phase factor should be introduced to the wave 
function before the whole calculation is applied [[n]]. Furthermore, there are peo- 
ple who assume that the gauge paradox aforementioned is intrinsic for quantum 
mechanics and can be solved only in quantum field theory; and there are also 
ones who believe that the gauge trouble is kind of superficial: if higher-order 
contributions, for instance from the Af-term, are carefully taken into account 
the result will turn out unique and correct. 

Though the situation is confusing and opinions are diverse, most in the 
community seem to lose interest in the subject. Related discussions disappear 
from the central circulation and only people in the "pedagogical field" still make 
fuss on it. However, we happen to have a different opinion. In our view, since the 
derivation of the standard perturbation theory, outlined in Sec. 2, seems quite 
stringent, the gauge problem aforementioned must be rather fundamental. The 
more stringent the derivation seems, the more fundamental the problem must 
be. In one of our recent paperQ, motivated partly by the desire of clarifying 
the gauge issue involved, we challenge the general validity of the principle of 
superposition. 

In this paper we confine ourselves to the perturbation theory. For this reason, 
we will, in what follows, examine the performance of the theory mainly in terms 
of practical calculations. 

Consider an atomlike system whose quantum state is specified by the quan- 
tum numbers n,l,m, which are the energy, azimuthal and magnetic quantum 
numbers respectively. (No spin is considered.) Suppose that the system is 
perturbed by a magnetic "pulse" 



Bx = eT(t)e z 



(15) 
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where T(t) is a function whose time-dependence is 

nt) = { ; \\f 1 ^. y (i6) 

w [0 (otherwise). v ' 

We may let the related vector potential take the form 

Ai = eT(t) {- V -b x + |e,) , (17) 

which is used commonly and also conforms to the preferential gauge. In terms 
of classical mechanics such perturbation will definitely affect the state of the 
system (linearly as well as nonlinearly) . If the Schrodinger equation could be 
solved exactly, the result should be qualitatively the same. 

However, the perturbation theory in QM tells us a different story. The 
perturbing Hamiltonian is, according to (JlC|) , 

eQT(t) f e 2 Q 2 T 2 (t) f 2 2 

Hl = —2^( XP » ~ VPx) + 8mc {X + V } - (18) 

Ignoring the e 2 -order nonlinear term , we arrive at 

(n,l,m\Ht\n',l',m') oc S nn >S u >S mm >. (19) 

This means that, linearly speaking, the transition probability from one state to 
another state is zero! 

If we wish to try a numerical calculation, in an attempt to include the nonlin- 
ear effects for instance, we will encounter even greater, also more fundamental, 
difficulties. The formal integration of (g) is 

C k (At) = C fc (0) - (i/ft)EC fc '(0)(fli) fcfc /At 

C k (2At) = C k (At) - (i/h) E^(A0(^i)^'e M "' At At (20) 

k' 



where k represents a set of the quantum number n,l,m. Suppose that the 
system is initially described by 

C fe (0) = l, C fc /(0) = 0(fcVfc). (21) 

The first equation of (^TJ) becomes 

C k {M) = 1 - UH x ) kk M. (22) 
h 

Since the perturbing Hamiltonian H\ = H — Hq is a self-ajoint operator, {H\) kk 
must be a real number. Expressions ( fl8| ) and (|l^) show further that the real 
number in our example is nonzero. All these lead us to 

\C k (At)\ 2 > 1, (23) 
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which is not acceptable if we take it for granted that the Schrodinger equation 
preserves the unity of the total probability. 

With the discussion above, we are convinced that there are sufficient practi- 
cal reasons for that the existing perturbation theory in QM needs to be recon- 
sidered. 

4 Quasi-Canonical Perturbation Theory 

Before presenting our new perturbation theory, we wish to offer a philosophical 
reason why we have to give up the existing perturbation theory. Coefficients 
of wave-function expansion, whose true values are not observable, do not serve 
as appropriate "objects" for a perturbation theory of the variation-of-constant 
type. Non-observable quantities are not bound up with the physical inertia of 
a quantum system; while the system undergoes only a small practical change, 
they may overreact and get prompt and large variation. (For instance, a phase 
factor depending on time and space nontrivially may involve instantly.) 

We believe that, as stressed in Ref. 7 and 8, in applying a perturbation 
method of the variation-of-constant type the two following requirements should 
be fulfilled: (i) Without perturbations, the defined variables, as "objects" of the 
perturbation theory, are true invariants, (ii) With perturbations involved, the 
defined variables remain essentially physical. Actually, failure to meet the two 
requirements has been the source of many errors and much confusion. 

For the reasons aforementioned, our theory will mainly be concerned with 
the dynamics of quantum observables. We shall mainly work in the Heisenberg 
picture, where the wave function does not change and the dynamics of physical 
quantities can be examined directly. Obviously, the features of the Heisenberg 
picture will serve our purpose extremely well. In this section, operators are 
conventionally defined in the Heisenberg picture. If necessary, the bar notation 
will be put on the head of an operator that is defined in the Schrodinger picture. 
The unitary operator e lS ^ t ' q ^ is reserved to relate the two picture such that for 
an operator u we have u = e~ lS ue lS . 

We now consider one motion invariant of a quantum system L(q, v). The 
use of L(q, p) is avoided, since such operator may represent an unphysical quan- 
tity after the time-dependent vector potential gets involved 0. Note that the 
operator L(q, v) can be constructed out of two basic operators 

q,p = q,mv+ ^A , (24) 

where q, po are canonical variables in, and only in, the unperturbed system. (In 
this sense the new theory may be regarded as a quasi-canonical one.) And, as 
almost always in quantum mechanics, L is assumed to be a polynomial of its 
basic operators q and po. In our discussion we do not consider a quantity that 
contain the third power of q and po. 
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Since we have assumed L to be a motion invariant in the unperturbed system, 
there must be 

§=(§) +{i,*.}„„ = 0, (25) 
v y q,Po 

where 

777 

H = jv 2 + Q$ . (26) 

For the later use, we rewrite ( p5| ) in terms of physical quantities. Trivially, the 
first term in the right-hand side of (^5| ) is 



dL 
~dt 



0. (27) 



q,po 



For an arbitrary operator G(q), we have 



1 FIT f)C 

{i (v), G ( q ))„„= , (28) 



Therefore, we get 

Q dL 

m dvi 



{L,Q$ } qiPo = ^_ * Eoi (29) 



The calculation of the term {L, mv 2 /2} is a bit more complex. First, we have 
{£, imu 2 } q , P0 = m{L, u,} q , Po * vf, (30) 



in which, 



m{L, i»j}q, po = {L,pj - — A oj } n>po . (31) 



By further calculations, 



Q A , Q dL dA 0j 



^~^U, P0 = ^*^ (32) 
and 

&L(q,v) Q <9A 0i 



Finally, we obtain 

{L, *.},,„ - g . , + £g . * + . ft, . - 0, (34) 

where e^; is the antisymmetric Kronecker symbol. 
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After a perturbation is applied, we have the equation of motion in the Heisen- 
berg picture as 



dL _ fdL 
~dt ~ \~dt 
'dL 

at 



{L,H}q }P 

(35) 

{i,Q$ 1 } q ,p + {L,F }q,p. 



In obtaining the equation above we have used H = H + where H is 

defined by (^6|) instead of ([)]). (see Ref. 7 for more analysis about this point.) 

We cannot let the last term in ( |35| ) vanish simply because ([|I]) . The Poisson 
brackets in ( |35| ) are defined under the system of (q, p) , which takes on the 
noncommutation relations 

{qi,Pj} = % i {Pi , Pj } = 0, (36) 



while the Poisson bracket in ([34]) is under the system of (q, po), which takes on 
the noncommutation relations 

{quPoj} = Sij, {Poi,Poj} = 0- (37) 

To make use of (|J), we should rewrite ( ^5|) in terms of observable quantities. 
By a direct calculation, we obtain 



dL\ I dL \ QdA u 

3t/ qp m dvi \ c dt 



(38) 



and 



with 



{£,Q$o} q , P = -f^*£oi (39) 



QdL_ 

m dvj 



The treatment of 



{L,Q<Z> 1 } tltP = ^-—*(-V i <S> 1 ). (40) 
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711 V 

{ L i^-}c l ,p = m{L,v i } cliP *v i (41) 



2 

is similar to that of (pOI). The velocity operator is in the situation 



v=-(p-^A -^A 1 ). (42) 
m c c 



Therefore, 



mu 2 , dL Qti ii dL . , , . 

— >- = * Vi + * (Bw + fl "> * ^ = °" (43) 
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It follows from 



dL dL Q dL Qe iit dL 

- - - * Vj H tt— * L Ql H * Hoi * Vj 

q 3 m OVi mc dvi 



dt dqj m dvi mc dv 

Q dL Qe yi dL 
— — * E u H — 3— * B lt * . 

m aw ? - mc aw, 



This equation can be regarded as the equation of motion for an observable in 
quantum mechanics. 

If the system is perturbed only after (and at) the time t, (|4|) becomes, by 
virtue of @, 

a, + Sat (45) 

at m OVi mc OVi 

where all the operators are those of the unperturbed system in the Heisenberg 
picture. In particular, v = (— i/Ti)\J — (Q/c)A.q. If desired, this equation can 
be written in terms of Poisson Bracket 

L = 0{L,$ 1 } + {q,L}*^^i 

o ° (46) 

+^ ({L, q} * {A U H } - {L, Ax} * {q, H }) , 
c 

where all the Poisson bracket is defined under the q, po-systcm. A comparison 
with the similar equation in Ref. 7 tells us that we can get ( fhi| ) simply by using 
the correspondence principle of Poisson bracket. 

Figure 1 
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Now, we assume that the perturbation makes its effect in a step-like fashion, 
shown in Fig. 1. At the beginning of each step, the system is considered as 
being in the unperturbed state and thus the variation of the observable after 
the step is 

AWfW (47) 



, dt 

where 



in which |0) stands for the initial state of the system at t = 0. 
As for the average expectation value of L at t — T, we have 



(L(T)) = L(0) 



dL 
~~dt 



dt. 



(49) 



where (dL/dt) is defined by (|48|). 

In view of the fact that motion invariants can form a complete set for a quan- 
tum system, it is, up to this point, appropriate to say that our time-dependent 
perturbation theory in quantum mechanics has been formulated completely. 



5 Application and discussion 



By Comparing (|45j), (|46|) and ( |49[ ) with counterparts in Ref. 7, we find that 
the correspondence principle, as well as the gauge-invariance principle, are well 
observed. 

It can easily be verified that if this new perturbation theory is applied in 
calculating effects of the magnetic "pulse" in Sec. 4 it will yield a much more 
reasonable nonzero result. 

As a theoretical application, we use the new perturbation theory to prove 
the ineffectiveness of the magnetic force in terms of changing energy. We define 
the energy of a quantum particle as 

e = — + $o. (50) 

Eq. (||) tells us that 

§) (!) <51) 

where (dL/db) e ^ m represent the contributions from the electric field and the 
magnetic force respectively. It is obvious that 

eijiVi * Bu * Vj = 0. (52) 

we obtain 

Note that the proof is not possible for the existing perturbation theory, in which 
effects of the electric force and the magnetic force cannot be separated. 

After obtaining the perturbation theory presented above, one could not help 
using it to further formulate the transition probability P n k, which seemed to 
be the ultimate goal of such a theory. Unfortunately, the effort failed. In 
what follows, we include the unsuccessful formulation to show that determining 
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the transition probability is beyond the scope of a perturbation theory of the 
variation-of-constant type. 

The first step of the formulation is to define a unitary operator as 

e zs = e lS e- l ^\ (54) 

where sit) is a self-adjoint operator. This is possible because the product of e lS 
and e ~ lHot / h must be unitary and a unitary operator can be expressed as an 
exponential of a self-adjoint operator^. We then assume that s is so small that 

e ls = 1 + is + ■ ■ ■ . (55) 

With this expression, we have the relation 

L(T) = e-^ T e- ls ^L(0y s ^e^ T 

RJ L(Q) + ie-^ T [L(0)s(T) - s(T)L(0)]e^ T , ! ''' ;; 



where the higher-order terms of s(T) have been omitted. Comparing ( pq ) with 
© yields 

Skk ' {T) = *(L k ~L k ,y , , 

T / ( 57 ) 

Q m \dvi c OVi J kk , 

where L k ^ L k > and (-)kk' is t ne matrix element of (— ). If the system is 
initially in the fc-state, the probability of finding it in the fc'-state is (fc ^ k') 

P kk> (T) = \(k\e^ T e^e-^ T \k')\ 2 « | Sfefc ,(T)| 2 . (58) 

Expressions ( p7| ) and ( p8| ) can indeed yield correct results sometimes. Inter- 
ested readers may use it to study the forced oscillator and compare the result 
with the exact solution in textbooks 

However, if we use the formula to calculate transition of an atom, inconsis- 
tency emerges. To see it, note that the subscript k and k' actually represents 
a set of quantum eigenvalues, which, for instance, are the energy e, the square 
of azimuthal angular momentum £ and the ^-component of angular momentum 
(. (We do not use L 2 and L z , since L has been used to represent a general 
invariant.) Note that ( pq) may take different values if we apply it to different 
motion invariants. For Q58|) to be uniquely defined, we have to, at least, show 



where 



Ses>(T) 



\s se ,(T)\ = \s cc (T)\ (59) 

e i(u k ,-uj k )T 



i(£k-£k>) 

J m \dvi c dv; / fcfc , 
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and 

e i{w k ,-uj k )T 

T , (Cfe_Cfc,) N (61) 

J Q m \dvi c dvi J kk , 

It is easy to see that, except for a uniform electric perturbation, (|5^) does not 
hold. 

By inspection of the derivation of (^?]), we find that the operator s(t, q) in 
( [54|) depends on both the time and the space, and the stringent legitimacy of 
expanding it into ([[5]) is not there. (This is similar to what happens to the 
wave- function expansion 

As we believe, to determine the value of Pkk 1 m general cases we have to 
deal with the Schrodinger equation directly. 

6 Brief summary 

In this paper, we have showed that the existing time-dependent perturbation 
theory in QM, like its counterpart in CM, suffers from difficulties and needs to 
be reconsidered. By investigating the dynamics of physical invariants, instead 
of the dynamics of coefficients of wave-function expansion, we have formulated 
a new time-dependent perturbation theory, in which the gauge-invariance prin- 
ciple and the correspondence principle are well observed. 

Discussion with Professors R. G. Littlejohn and Dongshcng Guo is gratefully 
acknowledged. This work is partly supported by the fund provided by Education 
Ministry, PRC. 
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